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ABSTRACT 

Recent calculations using non-linear relativistic cosmological perturbation theory show 
biases in the mean luminosity distance and distance modulus at low redshift. We show 
that these effects may be understood very simply as a non-relativistic, and purely kine¬ 
matic, Malmquist-like bias, and we describe how the effect changes if one averages over 
sources that are limited by apparent magnitude. This effect is essentially identical to 
the distance bias from small-scale random velocities that has previously been con¬ 
sidered by astronomers, though we find that the standard formula overestimates the 
homogeneous bias by a factor 2. 

Key words: Cosmology: theory, observations, distance scale, large-scale structure; 
galaxies: distances and redshifts 


1 INTRODUCTION 

It is well known that the local rate of expansion Ho is sig¬ 
nificantly perturbed, at linear order, by peculiar velocities 
associated with the growth of density perturbations. The im¬ 
pact of this on cosmological parameter estimation is quan¬ 
tified theoretically by calculating the covariance of the 1st 
order velocity field which is given in terms of the power spec¬ 
trum of density fluctuations (Hui & Greene 2006; Cooray & 
Caldwell 2006; Davis et al. 2011; Kaiser & Hudson 2014). 

The subject of this paper, in contrast, is the systematic 
bias in distances, and therefore Ho, caused by velocities, and 
which is a second order effect. This has been studied using 
2nd order relativistic cosmological perturbation theory in a 
number of recent papers (Vanderveld, Flanagan & Wasser- 
man, 2007; Li & Schwarz 2008; Clarkson, Ananda & Larena 
2009; Umeh, Larena & Clarkson 2011; Gasperini et al. 2011; 
Wiegand & Schwarz 2012; Fanizza et al. 2013; Ben-Dayan 
et al. 2012a, 2012b, 2013a, 2013b, 2014). 

These papers all compute the deviation of quantities 
such as the mean luminosity distance and distance modulus 
(log distance), averaged over a surface of constant redshift, 
from that which would apply in a homogeneous universe. 
Second order perturbation theory is being used in order 
to explore the regime of non-linear gravitational dynamics. 
Most of these papers describe the effect as backreaction from 
the formation of structure, though the term may be being 
used in a relatively broad sense compared to the narrow 
definition as the effect of non-commutativity of spatial av¬ 
eraging and time evolution deriving from the non-linearity 
of Einstein’s equations. 

Quantitative predictions in the context of conventional 
structure formation models are provided in e.g. figure 6 of 


Ben-Dayan et al. 2013b which shows that the bias falls off 
inversely as the square of the redshift; that the fractional 
perturbation to the mean distance 8d = (5<1 l)/ d,L is positive, 
and that the perturbation to the mean flux density 4> is 
negative with 5$ = (<5<!>)/ < f? ~ —0.55^. Further, according to 
Ben-Dayan et al. 2014 (hereafter BDMS14), for low redshift 
zCl the mean flux density perturbation is given in terms of 
( v 2 ), the total variance of the first order line-of-sight peculiar 
velocity, by 5$ = —(v 2 )/c 2 z 2 , and they give the bias in the 
distance modulus p = 5 log dr, = (5/ In 10) Indz, as (<5/z) = 
(7.5/ In 10)(u 2 )/e 2 z 2 . 

There are two surprising features of these results if 
they are assumed to be caused by inhomogeneity affecting 
the evolution of the averaged universe. First, a cosmologi¬ 
cal effect would be expected to grow with increasing red¬ 
shift rather than decrease. Second, one would expect per¬ 
turbations to distance, distance modulus and flux density 
to be related by (8p) = (5/In 10)(5dz,)/dz, and /<E> = 
— 2{8(Il)/ d,L, just as for an individual ‘standard candle’. The 
relations between these quantities obtained from perturba¬ 
tion theory are quite different, and suggest that the cause of 
these effects are fluctuations. In that case, the usual relations 
for a standard candle would not apply, simply because of the 
non-commutativity of averaging and non-linear transforma¬ 
tions; the mean of the square of a fluctuating quantity, for 
example, is of course not the same as the square of the mean. 
The effect of fluctuations and the non-linearity of the rela¬ 
tionships between dr,, p and 4> was discussed by BDMS14 
who noted that the bias in Ho depends on the observable 
used, and by Ben-Dayan et al. 2013a, who argued for using 
the flux density <f> in Ho measurements, claiming this to be 
the least sensitive to fluctuations. 

Statistical biases in distance estimation, often asso- 
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ciated with the names Eddington (1914) and Malmquist 
(1920), have been known and widely studied for a long 
time, in the context of both cosmological parameter esti¬ 
mation and measurements of large-scale peculiar motions 
or ‘cosmic flows’. Substantial biases may result from the 
typically ~ 20% uncertainty in luminosity distance estima¬ 
tors for galaxies such as are obtained from the Tully-Fisher 
(TF) relation for spirals (Tully & Fisher 1977) and from 
the ‘fundamental plane’ (FP) for elliptical galaxies (Djor- 
govski & Davis, 1987; Dressier et al. 1987). In particular, 
distance estimates to galaxies may suffer so-called ‘homoge¬ 
neous Malmquist bias’ in that field galaxies in some range of 
estimated distance will tend to have true distances that are, 
on average, systematically enhanced as more galaxies are 
scattered inward from larger distances than outward from 
smaller distances (see Lynden-Bell et al. 1988; Willick 1994; 
and the reviews of Faber et al. 1994 and Strauss & Willick 
1995 for more details). Lynden-Bell et al. 1988 showed that 
with a log-normal model for the distribution of distance er¬ 
rors the mean log-distance in a spatially homogeneous uni¬ 
verse would be biased upward by 51nd = 3A 2 where A 2 is 
the fractional distance error variance. 

This particular kind of bias may be avoided by con¬ 
sidering the mean peculiar displacement in redshift-space 
(where neighbouring sources have, to a good approxima¬ 
tion, the same distance) rather than the peculiar motion 
in estimated distance space (Schechter 1980). This bias is 
also not particularly relevant to the calculations above as 
they effectively assume perfect standard candles. What is 
relevant is the residual bias that persists after the bias from 
distance errors has been eliminated. This is driven by small- 
scale velocity dispersion which causes a scatter in the true 
distance for objects at the same redshift. This was first con¬ 
sidered by Lynden-Bell (1992) who calculated the shift in 
the mode of the distribution of log-distances for objects of 
a given recession velocity. Specialising to uniform density 
and ignoring selection effects and streaming motions gives 
8 In d = 3cr 2 /c 2 2 2 where a v is the velocity dispersion. Willick 
et al. 1997 also found, under the same simplifying assump¬ 
tions, that velocity dispersion induces a bias in the apparent 
magnitude (or distance modulus) of sources of given redshift 
of 8m = 3 x (5/ In 10) x a 2 /c 2 2 2 . And both of these are just 
what one would expect from the Lynden-Bell et al. 1988 
formula for the standard Malmquist effect with fractional 
distance error variance A 2 = cr 2 /c 2 2 2 , which seems very 
reasonable. 

Lynden-Bell (1992) and Willick et al. (1997) consid¬ 
ered the effect of motions on small-scales that are modelled 
as spatially incoherent with galaxies behaving like a gas of 
particles with a Maxwellian velocity distribution. This is 
very different from the modelling of velocities in perturba¬ 
tion theory, where the motion is like that of a smooth, cold 
fluid. But otherwise the results are qualitatively the same 
in that the bias falls off as 1 /z 2 and is proportional to the 
mean square velocity. This might lead one to suspect that 
the perturbation theory results are simply the analogue of 
Malmquist-like bias from small scale motions; which are en¬ 
tirely a consequence of kinematics and statistics. On closer 
inspection, however, there is a difference in that 8m is twice 
as large as the 8/.i of Ben-Dayan et al. 2014. 

In this paper we will explore these biases further. The 
questions we address are: To what extend can the perturba¬ 


tion theory results be understood in terms of kinematics and 
statistics? Why does there appear to be a difference between 
the effects of perturbative flows and small-scale incoherent 
motions? Is this some subtle relativistic effect? Or might it 
perhaps derive from some significant difference between the 
statistical properties of small-scale and large-scale motions? 
or from the neglect of density perturbations associated with 
the latter? Another question is why the perturbation the¬ 
ory analysis result for the bias on local measurements of Ho 
is determined by the total velocity variance, including that 
from very long wavelength perturbations, when one would 
expect only relative motions - which for super-survey scale 
modes are suppressed - to appear. 


2 MALMQUIST BIAS FROM LARGE-SCALE 

COHERENT FLOWS 

Here we will calculate the kinematic bias arising from ‘co¬ 
herent flows’ or ‘streaming motions’; these being the focus 
of the relativistic perturbation theory calculations. We con¬ 
sider small-scale ‘thermal’ motions later. Since we are in¬ 
terested in the low redshift regime 2 <C 1 we work in flat, 
empty space and, we will also ignore special relativistic ef¬ 
fects as the effects of interest here are generally of order 
~ ( v/cz ) 2 > (v/c) 2 . 

We first consider the bias in the distance and related 
quantities when averaged over the surface of constant red¬ 
shift as this is simple, illustrates the key features of the phe¬ 
nomenon, and is what was considered in the relativistic per¬ 
turbation theory studies. We then generalise the analysis to 
the more realistic case where we average these quantities 
over sources. 

2.1 Area Averaged Bias 

We imagine an ensemble of realisations of a smooth field 
of test particles that have a spatially continuous velocity 
field that consists of a Hubble flow Hr plus a statistically 
homogeneous random velocity perturbation field, and where 
one particle is selected at random as the observer and is 
taken to lie at the origin of spatial coordinate system. Let 
the velocity with respect to this observer be u(r) and define 
the peculiar velocity v = u — Hr. Let us further assume, 
in the spirit of perturbation analysis, that the amplitude 
and scale length for perturbations in the peculiar velocity 
are such that there is a unique mapping from velocity (or 
redshift) space to real space; i.e. all particles in some region 
of redshift space have the same peculiar velocity. Working in 
units such that both the speed of light c and the expansion 
rate H are unity, the distance is d = |r| = 2 — v where v is 
the line-of-sight component of the peculiar velocity. 

Consider a cone of infinitesimal solid angle dQ. In red¬ 
shift space, the intersection of this cone and a constant -2 
surface has area dA z = z 2 dfl. That two dimensional surface 
maps to surface element in real space that will lie at a per¬ 
turbed distance d = z — v = 2(1 — v/z) and which will, in 
general, be slightly tilted relative to the line of sight as there 
will, in general, be some gradient of v transverse to the line 
of sight V±v. The surface element area in real space is then 

dA r = (l-v/z) 2 (l + \V ± v\ 2 /2)dA z . (1) 
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The average of the fractional perturbation to the dis¬ 
tance 8d = (d — z)/z = —v/z over a solid angle Afl, weighted 
by real-space area, is then 


f dn(l - v/z) 2 ( 1 + \V ± v\ 2 /2)(-v/z) 
d /dft(l-r7z) 2 (l + |Vx«| 2 /2) 


( 2 ) 


We wish to evaluate 8d accurate to second order in ve¬ 
locities. Since there is a factor v/z in the numerator, that 
means we need only keep first order terms in the denomina¬ 
tor, and we can completely ignore the transverse derivative 
terms as they appear only at third order, to give 



the last factor here allowing for correlation between the nu¬ 
merator and denominator in ©• 

The integrals here are evaluated on the surface z = 
constant i.e. on the perturbed surface in real space d = z — 
v. Working to second order precision, 5d is given in terms 
of quantities on the constant distance surface d = z using 
v(z — v) = v(d = z) — vdv/dd + ... = v — (l/2)dv 2 /dz + ... 
(we can ignore the effect on the second order terms above 
as the change in these is third order in v). 

Taking the ensemble average, which we will denote by 
(...}, the expectation value of the first order term here van¬ 
ishes as the velocity is equally likely to be positive as neg¬ 
ative - this is equally true in real-space and redshift-space 
since, like v(d), dv 2 /dz is equally likely to be positive or 
negative - with the result 


(<5d) * 


2{v 2 ) 


r d,n r 

J a nj 


dn' 

AQ. 


(vv 1 ) 


1 r dQ r art .. /\ 2 \ 

= *] An J Kn {{v - v)) - 


(4) 


The last expression above makes it clear that (5d) > 0 
so the mean distance is biased upwards. It also shows that, 
for an averaging area that subtends a small solid angle 
AS2 -C 1, only velocities caused by density perturbations 
with scale comparable to or smaller than the averaging re¬ 
gion contribute significantly to the bias; for perturbations 
much larger than the averaging region size the velocity will 
vary little within the area so v' ~ v and the bias is strongly 
suppressed. 

If instead of the perturbation to the distance, which 
is linear in v (for given z), we calculate the perturbation 
to some observable X that is a non-linear function of dis¬ 
tance like the flux-density or the distance modulus then we 
need to include the second order term in the expansion of 
X expressed as a function of v/z. If the perturbation is 
5X = av/z + bv 2 / z 2 + ... then we simply replace the factor 
(—v/z) in © by av/z + bv 2 /z 2 and performing the same 
expansion - dropping terms that are cubic or higher in the 
velocity - and ensemble averaging that led to © and then 
to a now gives 


(8X) = (-2a + b)¥± + ^ 



(5) 


We can use this to give the fractional perturbation to 
the flux density of standard sources. These have 4>(d) oc 1/d 2 
so 4>(d) = $(z)(l — v/z)~ 2 and 5<s> = (4>(d) — $(z))/$(z) = 
(l — v/z)~ 2 — l = 2v/z + 3v 2 /z 2 +... so the ensemble average 


of the area averaged flux density perturbation 5$ is given by 
© with a = 2, b = 3 or 


<*•> = - 


(v 2 ) 


r do. r 

J a TiJ 


dn 1 

All 


(vv'). 


( 6 ) 


Similarly, the perturbation to the distance modulus 
(DM) ^ = 51og 10 d is Sfj, = aln(l — v/z) = —a(v/z + 
v 2 /2z 2 + •. •) with a = 5/ In 10 ~ 2.17, so the ensemble av¬ 
erage of the area average of is given by © with a = —a, 
b = —a/2 or 


(<W = a 


~ 3(u 2 ) 

2z 2 



(7) 


Note that in both of these cases, in contrast to ©. there 
is not complete suppression of the effect of perturbations on 
scales larger than the averaging area. 

If we take the averaging area to cover the entire sky, and 
assume that the redshift is sufficiently large that the distance 
to this shell is much greater than the coherence scale for the 
velocity fluctuations then the second term involving (vv') in 
each of equations mm&z® will be much smaller than the 
first term and we have 

(So) = 2 (v 2 )/z 2 

(5$) = ~(v 2 )/z 2 (8) 

<^> = (7.5/ln(10))(u 2 )/* 2 . 


These are identical to the low -2 limit expressions of 
BDMS14. So the relatively large low -2 effects are not in 
an essential way a result of non-linearity of gravitational 
dynamics (relativistic or Newtonian) as they are fully ac¬ 
counted for by kinematics and statistics. We believe, of 
course, that the velocities we observe are really caused by 
gravity, and non-linear structure is involved, but our point 
here is that the same bias would be found if one were observ¬ 
ing test particles of negligible mass with peculiar motions 
caused by non-gravitational forces. 

These Malmquist-like biases are easy to understand. 
The perturbation to the mean distance, for example, comes 
about because even though the velocity field on a sphere 
of constant -2 is equally likely to be positive or negative, so 
as many areas (or solid angle elements at the observer) get 
pushed out as get pushed in in distance-space, those that 
get pushed out to larger d get pushed in the radial direction 
and so get expanded in area by a factor (1 — v/z) 2 ~ 1 — 2 v/z 
(see figure©. Similarly those that get displaced inwards get 
compressed. The result is a rectification of the real-space 
area averaged distance. The different numerical factors for 
the other variables comes about simply because they are 
non-linear functions of the distance. 

BDMS14 noted that the above imply that the bias in 
Ho obtained from the area-averaged flux density is a factor 3 
lower than that obtained from averaging the distance mod¬ 
ulus. The above analysis shows that one can do even better 
by averaging K i ^ 2 oc (1 — v/z)~ 3 since this gives a — 3 and 
6 = 6 so—2o+ 6 = 0 and, in the approximation that the 
depth is greater than the coherence scale used to obtain ©, 
the bias vanishes. 

We would emphasise that, according to our analysis, the 
simple results © are only valid for velocity perturbations 
with coherence scale less than the distance. But at the same 
time the effects are really only significant at low redshift be¬ 
cause of the I/ 2 2 scaling. For realistic power spectra there is 


© 0000 RAS, MNRAS 000, 000-000 







4 Kaiser & Hudson 



A 


Figure 1. Dotted lines are lines of longitude and latitude on the 
surface of constant redshift. On this surface, peculiar velocities 
are equally likely to be positive as negative. The cone illustrates 
how a section of this sphere maps to real space for the case of 
a negative peculiar velocity. The section is pushed out radially 
away from the observer - who resides at the centre of the sphere 
- and consequently is expanded in area. Similarly, for a positive 
peculiar velocity the section would be compressed. The result of 
this is that the average of the distance, when weighted by real- 
space area, is positive. This is the cause of the bias found in the 
relativistic perturbation theory analyses. More relevant to real ob¬ 
servations is the bias in distance averaged over the sources that 
lie in a shell of given redshift. We consider this in £12.21 There 
we find that there are some relatively minor differences that arise 
from the clustering of sources and from the Jacobian involved in 
transforming volumes from redshift to real space, but the main 
difference is that the generalisations of m have different numeri¬ 
cal pre-factors when the sources are subject to selection based on 
flux density. 


significant contribution to the velocity variance from quite 
large scales; certainly extending to tens if not hundreds of 
Mpc, so except for observations at much greater distance 
- where the effects rapidly become uninterestingly small - 
one should not use these formulae with the total velocity 
variance computed in the usual way from the matter power 
spectrum rather one should use equations|4j[6]&[7]that incor¬ 
porate the terms involving the velocity correlation function 
(vv 1 ). 

It is also important to realise that we have defined the 
peculiar velocity here such that the velocity of the observer 
vanishes. Thus the variances and co-variances in these equa¬ 
tions are of velocities relative to the observer, which in prac¬ 
tice is usually taken to mean relative to the velocity of the 
local group (LG), since it is the LG peculiar velocity, unlike 
the motion of the earth or the sun, that is thought to best 
reflect the gravitational acceleration from large scale struc¬ 
tures. This eliminates the effect of perturbations on scales 
much greater than the survey depth which would otherwise 


give unphysical effects if the total velocity variance were 
used. 

This is somewhat at odds with BDMS14, and deserves 
some clarification. Their equations 5,6 give a bias that de¬ 
pends on the total velocity dispersion, including a contribu¬ 
tion that comes from modes which are larger than the survey 
scale spanned by the target objects (in their case Ho calibra¬ 
tors). This is the dispersion of one component of the velocity 
of a galaxy relative to the ‘cosmic-frame’, as is thought to 
be well approximated by the frame in which the CMB has 
zero dipole (since any intrinsic dipole is usually thought to 
be very small). In their discussion of this BDMS14 say that 
they remove the motion of the observer since the observa¬ 
tions are usually quoted in the CMB frame, corresponding 
to vo = 0, and that a non-vanishing observer velocity would 
nearly double the effect. This doubling seems to us to be 
misleading. The observer velocity is not zero in the CMB 
frame - the LG is moving at about 600 km/s in that frame 
- but the CMB frame is not of much relevance here as the re¬ 
sults should be independent of any frame that the observers 
choose to refer the observations to. Our formulae, including 
the correlation function (vv'), refer to ensemble averages 
and, if one had no idea how the LG motion originated, then 
these should be in the LG frame. Working in the LG frame 
would indeed increase the co-variance from perturbations 
on scales smaller than the survey scale, though the effect of 
motions on larger scales would still be suppressed. 

But there is a difference between the variance of the 
motions of different source regions and our motion, which 
has a variance in an ensemble sense, but we only sample 
one realisation of the ensemble (though it is a realisation 
of all three components of the velocity, not just one). The 
exact impact of the LG’s motion depends on the depth of 
the gravitational sources that are responsible for its motion: 
if these sources are deeper then the Ho secondary calibra¬ 
tors themselves, then the Ho calibrators and the LG motion 
share the same bulk velocity and so, by operating in the LG 
frame, these super-survey modes disappear, as noted above. 
If on the other hand, the source of the LG’s motion is very 
local to the LG itself (for example, a very nearby attractor 
such as Virgo), then, when operating in the LG frame, the 
LG motion induces a coherent dipole pattern (see Kaiser & 
Hudson 2014 and references therein). This coherent dipole 
is different in character to the less-coherent distortion due 
to the motions of the Ho calibrators. 

In practice, however, the LG’s motion arises from grav¬ 
itational sources over a wide range of distances, so the 
true situation is more complicated than the two scenarios 
sketched above. Fortunately, by mapping out the distribu¬ 
tion of nearby galaxies with an all-sky redshift survey and 
predicting peculiar velocities via linear perturbation theory, 
we now have a good idea of the gravitational sources respon¬ 
sible for much of the LG’s motion (e.g. Erdogdu et al. 2006; 
Lavaux & Hudson 2011, Carrick et al 2014). Consequently, 
because in practice these surveyed volumes contain within 
them the secondary calibrators with which one is attempt¬ 
ing to measure the local Ho, the bias in the local value of 
Ho could be reduced by working in the frame of the red¬ 
shift survey itself. In other words, the solution is to use the 
predicted peculiar velicities to correct for the redshifts of 
the calibrators (Neill et al. 2007, Riess et al. 2011), leaving 
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only 150-200 km/s of peculiar velocity not well described by 
linear theory (Carrick et al. 2014). 


2.2 Galaxy Averaged Bias 

We now explore how the bias changes if, as is the case in 
reality, we average distances over galaxies (or supernovae), 
rather than perform an area weighted average on the sur¬ 
face of constant redshift, and allow for the fact that such 
sources are subject to selection bias. This involves weight¬ 
ing by volume elements of a shell that maps to a shell of 
constant thickness in redshift space, rather than by area on 
the surface of constant 2 , and this introduces a factor which 
is the Jacobian of the real- to redshift-space transformation. 
And there are additional weighting factors coming from the 
varying real-space density of galaxies arising from structure 
and from the distant dependent selection function. 

Consider a segment of a spherical shell in redshift space 
at redshift z and thickness dz that subtends a solid angle 
dn at the observer, and which therefore has volume dV z = 
dO,z 2 dz. This maps to a volume in real-space 

dV r = {l-v/z) 2 (l-dv/dz)dV z (9) 


where we see the Jacobian 1 — dv/dz. Unlike the tilt factor 
1 + |V±u| 2 /2, which was ignorable, this has a first order 
component. 

The expected number of detected galaxies is propor¬ 
tional to the product dV r (l + 8)</)(d) where <5 is the real-space 
galaxy density contrast and <j> is the selection function, which 
we can take to be a function of real distance d, since e.g. ef¬ 
fects from aberration caused by our motion changing area 
of galaxies is an order v/c effect and is relatively negligible. 

For v -C z we can make a first order expansion and 
write the latter as (j>{d) = />{z){ 1 — (v/z)d\n(f>/d\n z + ...). 
As before we shall only need to keep the first order term in 
the expansion of (1 — v/z) 2 , so we can use (1 — v / z) 2 4>{d) = 
4>(z){ 1 — (2 + 7 )v/z + ...) where 7 = dln<j>/dlnz. 

The average of the fractional perturbation to the dis¬ 
tance over a solid angle AS2, weighted by galaxy number, is 
then 

— _ / dQ( 1 - (2 + 7 )u/a)(l - dv/dz)(l + S)(-v/z) 

/ df 2 (l — (2 + y)v/z)(l — dv/dz){ 1 + 5) 


As before, we wish to evaluate 5d accurate to second 
order in perturbed quantities (now including 5 as well as 
velocity). Expanding and neglecting terms that are cubic or 
higher yields 


5 d = - 


r dn_ 
] An 


dn v ( (2 + 7)11 dv 

An z 1 2 dz ”*~ 


+ 


r dn' | 

((2 + -y)v' 

+ dv ' A\ 

/ An 1 

A 2 

dz )j 


( 11 ) 


Again, when we take the ensemble average we will as¬ 
sume that the first order terms vanish by symmetry. As al¬ 
ready noted the product of v and dv/dz should average to 
zero, as does the product of v and S. 

But we have extra 2nd order term involving the product 
oi v/z with S' and dv'/dz. For a statistically homogeneous 
random field the expectation of the field and its derivative 
at two different locations does not, in general, vanish, nor is 
(vS 1 ) = 0 in general. Generalising to an observable X whose 


perturbation has the expansion SX = av/z + bv 2 /z 2 + ... as 
before, the analogue of © is 


(JX> 


(v 2 ) 

(-(2 + 7 )a + b)^l 



(2 + 7 )(tu/) + z{dv'/dz 



( 12 ) 


On dimensional grounds, one might expect these new 
terms appearing in the double integral to have a large con¬ 
tribution (as compared to the term involving vv') from 
perturbations with wavelength A <§; 2 since both S' and 
dv'/dz ~ v/X. But that is misleading for the following rea¬ 
son. That part of the velocity field which derives from waves 
in the Fourier spectrum with wave-number k = 2n/X has a 
coherence scale of order A. So pairs of points that have signif¬ 
icant correlation are restricted to have separation ~ A, and 
if A -C z these pairs have a separation whose direction is 
nearly perpendicular to the line-of-sight. This actually sup¬ 
presses the contribution to (8d) from the vdv'/dz term to be 
smaller than that from the vv' term. The same is true for 
the term involving {v5'). Thus the differences introduced by 
averaging over galaxies, as opposed to the simpler averaging 
over areas, are small. 

As was the case of averaging weighting by area, if we 
average over the entire sky and assume that this covers many 
‘coherence-areas’, then we can ignore the double integral in 
m and we have, in analogy with ©, 

(fid.) = (2 + 7 )(u 2 )/2 2 

(J?) = -(1 + 2 7 )(u 2 )/2 2 (13) 

CD) = (5/ln(10))(3/2 + 7 )(u 2 )/ 2 2 . 

For distances of practical interest, the actual bias in¬ 
volves the additional terms in m■ But the simpler expres¬ 
sions above are potentially useful in a situation where large- 
scale motions have been modelled and corrected for, as they 
would then describe any residual bias caused by un-modelled 
motions on smaller scales. 

At any redshift the variable d £ is unbiased for n = —3 — 
27 ( 2 ). In terms of flux density 4? (and selection function <j>) 
this is $ 3 / 2 +din^/di nz At the distance at which the number 
of galaxies per logarithmic interval of distance is maximised 
- the distance where most of the galaxies reside, in some 
sense - the selection function is falling as 4> oc d~ 3 and so the 
unbiased variable is d£ oc (as compared to the <I > +3 ' /2 

that applies if there is no distance dependence selection). 


3 MALMQUIST BIAS FROM INCOHERENT 

SMALL-SCALE MOTIONS 

The foregoing analysis was somewhat restricted in that it 
was assumed that at each point in real-space there is a single 
velocity - i.e. that the galaxies move like a fluid, thus ruling 
out application to bound virialised systems where there are 
multiple streams - and yet more restrictive in that it was 
assumed that there was a unique velocity at each point in 
redshift space; which rules out e.g. ‘triple valued’ regions in 
redshift space that exist around clusters. These assumptions 
are reasonable only for large scale motions. 

At the other extreme, a useful and commonly used 
model for small scale motions within bound structures is 
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that these motions are spatially incoherent with peculiar 
velocities drawn from a distribution function P v (y)dv. As 
mentioned in the Introduction, the bias caused by small- 
scale motions with an assumed Maxwellian distribution (for 
which the distribution of the line-of-sight velocity is Gaus¬ 
sian) was considered by Lynden-Bell (1992) and by Willick 
et al. 1997, both of whom found an effect qualitatively simi¬ 
lar to, but twice as large as, the bias obtained from perturba¬ 
tion analysis (for motions with coherence scale less than the 
size of the averaging region). This is puzzling. Why would 
the result care about whether the coherence scale is just 
much smaller than the averaging cell size or microscopically 
small? 

We now show, at least in the limit that v z, that 
the result for (5p) in ([ 8 ]) applies also to small-scale inco¬ 
herent motions, and that the use of the standard formula 
for the bias with distance errors replaced by velocity errors, 
while entirely plausible, actually over-predicts the effect (by 
a factor 2 in the case that selection is ignorable). 

At any z, P(d\z) oc P(d, z) = P{z\d)P{d). But 2 = d+v, 
so P(z\d) = P v (z — d\d). If we assume that the distribution of 
peculiar velocities P v is position independent, then P(d\z) oc 
Pd{d)P v (z — d), from which we can compute expectation 
values for distance, distance modulus etc.. 

With 5p = a In (d/z) and assuming galaxies are uni¬ 
formly distributed in angle, but subject to some smoothly 
varying selection function rf)(d ), so Pd(d) = Pd(z)( 1 — (2 + 
7 )v/z + ...), the mean DM for galaxies at redshift 2 is 

/r , N (1-(2 + 7 )2) i>„(„) a + 

(P|Z> /du(l-(2 + 7 )f)P» 

(14) 

or, keeping only terms up to second order in velocity in the 
numerator and only the leading order term in the denomi¬ 
nator, 

(5u\z) = a(-(v)/z + ( 7 + 3/2 ){v 2 )/z 2 ). (15) 

For the assumed centred Gaussian distribution, (v) = 0 
and m agrees with the the third of m and, ignoring 
selection (i.e. setting 7 = 0 ), we have 

(6fj,\z) = (7.5/ ln( 10 ))(u 2 )/ 2 2 (16) 

in accord with the third of © but in conflict with equation 
15 of Willick et al. 1997 and at odds both with equation 9.17 
of Lynden-Bell (1992) and with the seemingly reasonable 
analogy with Lynden-Bell et al. 1988, all of which would 
suggest that for a uniform spatial distribution of galaxies 
dlnd = 3 (v 2 )/z 2 , which is twice as large as what we have 
here. 

The reconciliation with Lynden-Bell (1992) is that the 
quantity he considers is the mode of P(ln(d)|u) the distri¬ 
bution of log-distances given an observed recession velocity 
v and assuming a Gaussian scatter in v. That is the most 
probable log-distance. But what we are interested in here is 
the mean of the log-distance. The In d probability distribu¬ 
tion, under these conditions, is asymmetric, and the shift of 
the mean is half the shift of the mode. Using the shift of 
the mode, we would argue, overestimates the ‘homogeneous 
Malmquist bias’ caused by small scale velocity dispersion by 
a factor two. 

Regarding the analogy with Lynden-Bell et al. 1988, 
what they assumed was a model for FP distance errors 


in which the probability distribution for the estimated log- 
distance l e given a true log-distance l = lnd was a Gaussian: 

P{le\l) = ( 2 ttA 2 )- 1/2 exp(—(l e - l) 7(2A 2 )). (17) 

In the present context redshift 2 plays the role of estimated 
distance, with v the distance error. But the model (1171) dif¬ 
fers from that assumed above (with a Gaussian distribution 
for velocity errors) in two respects: First, this distribution 
implies an asymmetric distribution for the peculiar veloc¬ 
ity, with a non-zero mean and asymmetric tails. Second, the 
fractional distance error is independent of distance, so in 
this model the absolute error grows with distance. This is 
appropriate for TF or FP distances, but not for errors pro¬ 
duced by random motions. As we show in the appendix, the 
former does not, by itself, resolve the inconsistency; if one 
uses the moments of v implied by this distribution in (1151) 
this gives <5/r = (5/ In 10)A 2 , which does not agree with (1161) 
nor, for that matter, is it in accord with dlnd = 3A 2 . The 
full resolution, again demonstrated in the appendix, is that 
one needs to modify the above argument to treat the case 
of distance independent fractional distance errors, and the 
bias is then given by (IA3I) which is very similar to (1151) but 
which has the numerical factor 7 + 3/2 replaced by 7/2. Us¬ 
ing the first and second velocity moments implied by GH) in 
(I A 31) gives (dp\z) = (15/ln(10))(u 2 }/2 2 in accord with the 
usual formula <5lnd = 3 (v 2 )/z 2 . But this is not correct for 
distance errors from velocities, which is what we are con¬ 
sidering here, where it is the absolute rather than fractional 
distance error that is independent of distance, and where 
the velocity distribution is symmetric. 

The above argument is idealised in that it assumes both 
the density of galaxies and the velocity distribution function 
to be independent of position. Regarding the homogeneous 
Malmquist bias the effect of relaxing this is that the expec¬ 
tation of the sky-averaged Sfi involves the galaxy weighted 
velocity variance. For large-scale density perturbations there 
is also an inhomogeneous Malmquist bias term (whose ex¬ 
pectation vanishes), just as found by Lynden-Bell (1992). In 
this regard, we note that the variable <j> 3 / 2 + dln, » i / dlnz ; s on iy 
unbiased with respect to the homogeneous Malmquist bias 
and is still affected by the inhomogeneous Malmquist bias. 


4 SUMMARY 

We have shown in SJ2] that the relatively large low-redshift 
perturbations to the mean distances, flux densities or dis¬ 
tance moduli obtained from relativistic second order per¬ 
turbation theory can be understood as a purely classical 
kinematic and statistical Malmquist-like effect and are not, 
in any essential way, a manifestation of non-linear dynam¬ 
ics. While gravity is involved in generating peculiar veloc¬ 
ities, precisely the same bias would be found if one were 
observing test particles with non-gravitationally generated 
motions. The relativistic treatment may contain other ef¬ 
fects that are essentially gravitational in nature, but as they 
are apparently extremely small they are of limited interest. 

In H2.2I we generalised the analysis to obtain the bias 
when, as in reality, the distance is averaged over sources 
such as galaxies or supernovae that are subject to selection 
bias. 

Our analysis provides formulae that could, in principle, 
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be used to correct for biases in distance, and hence in the 
‘local’ value of Ho, from large-scale or small-scale motions. 
For the former, our results properly account for covariance 
and suppression of the effect of super-survey modes that is 
missing from the relativistic perturbation theory papers. But 
we emphasise that the effects on Ho at least are very small 
and much smaller than the fluctuations in measurements of 
Ho that arise in linear theory. 

We have shown in [J3]that small-scale incoherent veloci¬ 
ties have essentially the same effect. They do not cause a per¬ 
turbation to log-distance <51nd = 3<r^/c 2 a 2 as has previously 
been found, and as would seem reasonable by analogy with 
the commonly used formula for homogeneous Malmquist 
bias. The effect is a factor two smaller. The reason that 
the standard formula is not valid for bias from velocity dis¬ 
persion is in part because the model implies an unrealistic 
distribution of velocities and in part because it assumes that 
the distance errors scale linearly with distance whereas er¬ 
rors from motions are distance independent. 

We showed that the average of $ 3 / 2 + dln 4 , / dlnr ' does not 
suffer velocity dispersion induced homogeneous Malmquist 
bias. 

We provide in appendix 0 a slightly generalised for¬ 
mula for the homogeneous Malmquist bias produced by er¬ 
rors in estimated luminosity distance - as from e.g. Tully- 
Fisher or fundamental plane techniques - when using the 
‘forward’ method. This result is only valid for A 2 1, but 
makes no assumption about the form for the distribution 
function for the distance errors. 
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APPENDIX A: MALMQUIST BIAS FROM 
LUMINOSITY DISTANCE ERRORS 

We now obtain the analogue of m for the situation where 
distances are estimated from the source flux density, rather 
than redshift, and the distance error scales linearly with dis¬ 
tance as is appropriate, to a first approximation, for TF or 
FP distances or for supernovae. Thus we assume sources 
with real distances d and estimated distances 3 and distance 
error z — d — v. I.e. we use the same notation as before, but 
with a different interpretation as the the cause of the errors, 
and to obtain a distance independent distribution for frac¬ 
tional errors we take P v (v\d)dv = f (v / d)dv/d where f(y) is 
some normalised bell-shaped function: f dy f(y) = 1. 

If we assume the fractional distance errors are small 
v -C d, we have 



where f'(y) = df/dy. 

Our goal is to compute (d\z) from the conditional dis¬ 
tribution of distance P(d\z) oc P(z\d)P(d). Previously we 
used P{d) = d?(j){d), but here the estimated distance is not 
the redshift, it is the inverse square root of the flux density, 
so a magnitude limit imposes a selection that is a function 
of the estimated distance (2 in our notation). The upshot, as 
explained by Strauss & Willick (1995), is that the selection 
function drops out when we compute (d\z) or, equivalently, 
the bias is the same as obtained without any selection. 

Using (03 in m we find two extra terms that produce 
significant contributions to the numerator (when multiplied 
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by —v/z and integrated): 



where we have integrated by parts and assumed that / falls 
to zero for large argument sufficiently fast that the boundary 
term is negligible. 

As in CG3> one finds, in additional to a term proportional 
to the variance in the distance error, the mean distance er¬ 
ror — (v). This is not necessarily zero - it is not zero, for 
instance, if the distance estimator is obtained by minimis¬ 
ing residuals in magnitude - but it is reasonable to assume 
that the strength of any bias in ( v) is, to order of magni¬ 
tude, at most proportional to the variance ( v 2 ). Keeping 
terms up to quadratic order in the distance error v and ig¬ 
noring the sub-dominant terms in the denominator in © 
yields the general result valid up to linear order in the frac¬ 
tional distance variance A 2 = ( v 2 )/d 2 for the homogeneous 
Malmquist bias 

(dfj,\z) = a(—(v)/z + 7/2 (v 2 )/z 2 ) (A3) 

where v is minus the distance error and 2 can be taken to be 
either the estimated distance 2 or the real distance d = z—v. 
One can also obtain the perturbation to any other variables. 
The perturbation to the distance, for instance, is 

(Sd.) = ~{v)/z + 4 (v 2 )/z 2 . (A4) 

As a check, we can apply <lA3l1 and (lA4l) to the log¬ 
normal model of Lynden-Bell et al. 1988 that is known to 
give exactly dlnd = 3A 2 (or ( dfj\z) = (15/In 10)A 2 ). In 
this model the probability distribution for the estimated log- 
distance l e given a true log-distance l = lnd is a Gaussian: 

P(l e \l) = (27rA 2 r 1/2 exp( (Z e - Z) 2 /(2A 2 )). (A5) 

With d = e and 2 = e e the moments of the estimated 
distance distribution are ( 2 n ) = d n exp(n 2 A 2 /2). The first 
moment is ( 2 ) = d(l + A 2 /2 + ...), so the mean of the 
distance error is ( v) = dA 2 /2 + ..., which is non-zero, and 
the second moment is ( 2 2 ) = d 2 (l + 2A 2 + .. .) so the distance 
error variance is (v 2 ) = d 2 A 2 + ..., where the notation ... 
indicates quantities that are of higher order in the assumed 
small logarithmic variance A 2 . Using these in (lA3l) gives 
(d[i\z) = (15/ In 10) A 2 in agreement with Lynden-Bell et al. 
1988, while (1 A 41) gives 8d = (7/2)A 2 , in accord with equation 
185 of Strauss & Willick (1995). 

Equations (T£3l) and (IA41) provide a generalisation of the 
standard results in that they do not assume a perfectly log¬ 
normal distribution, though they are limited to the regime 
where the fractional variance A 2 <g 1. They apply only to 
the ‘forward’ method where one averages the peculiar ve¬ 
locity of objects as a function of estimated distance. The 
more popular ‘inverse’ methods do not suffer this bias. In¬ 
stead they have the much smaller residual bias from random 
motions causing scatter in distance that is the focus of the 
main part of this paper. 
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